CDS

Accession Number TCMCG023C18020
gbkey CDS
Protein Id PIN09212.1
Location join(74116..74166,74242..74277,74356..74468,75323..75368,76864..76910,77189..77296,77942..78017,79527..79599,79701..79846,79935..80020,80956..81095,81172..81311,81395..81445,85494..85557,85711..85854,86035..86102,86231..86289,86984..87140,88941..88993,89085..89183,89962..90076,90148..90270,90761..90925)
Organism Handroanthus impetiginosus
locus_tag CDL12_18206

Protein

Length 719aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA324125, BioSample:SAMN05195323
db_source NKXS01003581.1
Definition Mismatch repair ATPase MSH4 (MutS family) [Handroanthus impetiginosus]
Locus_tag CDL12_18206

EGGNOG-MAPPER Annotation

COG_category L
Description MutS family domain IV
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGCTTGGAGTCTCTGAGCTGGTTGACAGATTTTGCTCTTTGGCCTCGAAGGTTGCACTGGGTCGAGGTTGCTTTGATGATACCAGGGGAGCTGTGCTAGTAAAAAATTTGGCGGCCAAGGAACCATCCGCTCTTGGTCTGGATACATATTACAAGCAATATTATCTCTGCTTGGCTGCAGCTGCTGCAACCATTAAGTGGATAGAAGCAGAGAAAGGGGTGATCATCACTAACCACTCATTAACCGTTACTTTTAATGGATCATTTGACCACATGAATATAGATGCTACTAGTGTCCAGAACTTGGAAATCATTGACCCAATGCACTCCGGTCTTTGGGGTACTAGCAACAAGAAGAGAAGTCTGTTTCACATGCTCAAGACAACACGAACTGTTGGAGGAACAAGACTTCTGCGAGCCAATCTTTTGCAGCCTCTTAAAGACATTGAGACAATCAATGCTAGGCTTGATTGTCTGGATGAGTTGATGAGCAATGAGCAACTGTTCTTTGGCTTGTCCCAGGCTCTTCGTAAGTTTCCAAAAGAAACTGATAAGGTCCTCTGTCACTTCTGCTTTAAGCAAAAGAAAGTTACTAATGGAGTCTTGGCTATTGACAATTCCAGAAAGAGCCAAATCTTGATATCAAGCATTATCCTTCTCAAAACAGCCCTAGATGCCTTACCATTACTCTCCAAGGTGCTCAAGGACGCCAACTGTTTTCTACTAAAAAATATTTACAAGTCCATATGTGAGAATGAAAAGTTTGCTTCTATGAGGACAAGGATTGGCGACGTGATAGATGAAGATGTTCTTCATGCTCGTGTTCCCTTTGTTGCTCGAACACAGCAGTGCTTTGCTGTAAAGGCAGGAATTGATGGACTTCTAGATATTGCACGGAGATCCTTTTGTGACACCAGTGAAGCAATACACAACTTAGCAAACAAGTACCGTGAGGATTTTAAGCTGCCAAATTTGAAAATCCCATACAACAACAGGCAAGGTTTTTACTTCAGCATACCTCAGAAGGACATACAGGGAAAACTTCCCAGCAAGTTCATCCAGGTCATGAAACATGGAAACAACATACATTGCTCTTCTCTGGAGCTGGCCTCTTTGAATATAAGGAACAAGTCGGCAGCTAAAGAGTGCTACGTCCGGACAGAATTTTGCCTGGAAGCACTAATGGATGCTATACGGGAGGATGTCTCTGTGCTCACACTTCTAGCGGAGGTCTTGTGTCTTCTTGATATGATAGTTAATTCATTTGCTCATACAATATCCACAAAGCCAGTTGACAAATTTACTAGACCTCAATTCACATATGATGGTCCGTTGGCAATTGATTCAGGACGACACCCCATCCTTGAAAGTGTACACAGTGAGTTTATTGCCAACAACATTTTTCTTTCTGAAGCATCAAATATGGTAATTGTGACGGGCCCAAACATGAGCGGAAAGAGTACTTATCTTCAGCTAGTTTGCCTGGTGGTCATCCTTGCCCAAATTGGTTGTTATGTTCCCGCGCGCTTTGCAACTTTGAGAGTAGTGGATCGCATATTTACTAGAATGGGAACTATGGACAGCGTTGAATCAAATTCTAGCACATTTATGACAGAGATGAAAGAAACGGCTTTTATCTTGCAAAACGCTTCTCACAGAAGTCTGATTGTTGTGGATGAATTGGGGAGAGCAACATCTTCCTCTGATGGGTTTGCAATTGCGTGGAGCTGCTGCGAGCATCTACTGGCTTTAAGAGCGTATACTGTATTTGCTACTCACATGGAAAACCTATCTGAATTGGCCACCGTTTATCCAAATGTGAAAATTGTTCACTTCGACGTTGAGATTAAGAATAAGCGCATGGATTTCAAGTTTCAACTGAAAGATGGACCGCGGCATGTAGCACAGTATGGCCTCATGCTAGCAGGAGTAGCTGGAATACCAAATCCTGTGATAGAGTCCGCCAAAAGCATCACATCAAAGATTACTGAAAAGGAAGTGAAGAGAATAAAGATAAATTTCCAACAGTACGAAGATCTTCAAAGGGCTTACCGTGTTGCTCAGCGACTGACGTGTCTGAAATACTCCAACCAAGATGAAGATTCTATTCGCCAAGCTTTACAGATTCTTAAAGAAAGTTGTATCAATGGCGGGCTCTAA
Protein:  
MLGVSELVDRFCSLASKVALGRGCFDDTRGAVLVKNLAAKEPSALGLDTYYKQYYLCLAAAAATIKWIEAEKGVIITNHSLTVTFNGSFDHMNIDATSVQNLEIIDPMHSGLWGTSNKKRSLFHMLKTTRTVGGTRLLRANLLQPLKDIETINARLDCLDELMSNEQLFFGLSQALRKFPKETDKVLCHFCFKQKKVTNGVLAIDNSRKSQILISSIILLKTALDALPLLSKVLKDANCFLLKNIYKSICENEKFASMRTRIGDVIDEDVLHARVPFVARTQQCFAVKAGIDGLLDIARRSFCDTSEAIHNLANKYREDFKLPNLKIPYNNRQGFYFSIPQKDIQGKLPSKFIQVMKHGNNIHCSSLELASLNIRNKSAAKECYVRTEFCLEALMDAIREDVSVLTLLAEVLCLLDMIVNSFAHTISTKPVDKFTRPQFTYDGPLAIDSGRHPILESVHSEFIANNIFLSEASNMVIVTGPNMSGKSTYLQLVCLVVILAQIGCYVPARFATLRVVDRIFTRMGTMDSVESNSSTFMTEMKETAFILQNASHRSLIVVDELGRATSSSDGFAIAWSCCEHLLALRAYTVFATHMENLSELATVYPNVKIVHFDVEIKNKRMDFKFQLKDGPRHVAQYGLMLAGVAGIPNPVIESAKSITSKITEKEVKRIKINFQQYEDLQRAYRVAQRLTCLKYSNQDEDSIRQALQILKESCINGGL